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Abstract 

Rank-order relational data, in which each actor ranks the others ac- 
cording to some criterion, often arise from sociometric measurements 
of judgment (e.g., self-reported interpersonal interaction) or preference 
(e.g., relative liking). We propose a class of exponential- family mod- 
els for rank-order relational data and derive a new class of sufficient 
statistics for such data, which assume no more than within-subject 
ordinal properties. Application of MCMC MLE to this family allows 
us to estimate effects for a variety of plausible mechanisms governing 
rank structure in cross-sectional context, and to model the evolution 
of such structures over time. We apply this framework to model the 
evolution of relative liking judgments in an acquaintance process, and 
to model recall of relative volume of interpersonal interaction among 
members of a technology education program. 

Keywords: ERGM; social networks; ranks; weighted networks; tran- 
sitivity; mutuality 



1 Introduction 



Rank-order sociometric data have a long history in the social sciences (New 



comb, 1961 Sampson 1968 Bernard and Killworth 1977). While many 
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network processes (e.g., diffusion, brokerage, exchange) are only sensibly 
posited for networks with categorical or ratio scale relationship states, many 
others — particularly those involving personal preferences (e.g., liking, advice- 
seeking) — are much more readily represented ordinally, and, indeed, may not 
even have interval, ratio, or categorical meaning. It is also common prac- 
tice in statistics to reduce data with unknown or problematic distributional 
form to ranks for nonparametric analyses such as Kruskal-Wallis and Mann- 
Whitney procedures, and similar use may be made of ranks in networks with 
valued ties as a compromise between analyzing tie values as they are and 
dichotomizing them (Newcomb 1961 pp. 32-34, for example). 

The most common approach taken to analyzing rank-valued network data 
in current practice is to dichotomize ranks into binary ties, defining a tie to 
be present if a given ego had ranked a given alter above a certain cut-off and 
absent otherwise. Cut-offs have been set at a particular rank (e.g., top 5) 



(Breiger, Boorman, and Arable 1975; White, Boorman, and Breiger 



Arable, Boorman, and Levitt[|1978[|Wasserman 



19801 IPattisolIt [1982] 



1976 



arris,! 



Florey, Tabor, Bearman, Jones, and Udry 2003 for example); at a particular 
quantile (e.g., top 50%) (Krackhardt and Handcock, 2007); or have been 
found adaptively (Doreian, Kapuscinski, Krackhardt, and Szczypula 1996). 
Another common approach is to focus on rank correlations and on treating 
ranks on an additive scale (Newcomb, 1956; Nakao and Romney, 1993). 

These approaches come with significant limitations. Dichotomizing ties 
requires a threshold point to be selected, inevitably discarding information 
and possibly introducing biases (Thomas and Blitzstein, 2011), while meth- 
ods like rank correlation are limited to simple comparisons and cannot, for 
example, be used to examine the strength of one social factor after controlling 
for the effects of another. 

Modeling frameworks explicitly designed for rank-order data would ad- 
dress these limitations, but to the authors' best knowledge, model-based 
approaches to rank network data as such have been very limited. Gormley] 



and Murphy (2007 2008), for instance, use a generalization of the Plackett- 



Luce model (Plackett, 1975) in a latent position framework to model what 
can be viewed as a bipartite rank-order network of affiliations from voters to 
candidates in Irish proportional representation through the single transfer- 
able vote elections. Null models for comparison of rank-order (or otherwise 
valued) data structures were developed by Hubert (1987), and model-based 
extensions of this approach for comparison of multiple structures have been 
introduced by Butts (2007b). (See also related work on null hypothesis test- 
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ing in a network regression context, e.g., that of Krackhardt (1987) and 



Dekker, Krackhardt, and Snijders (2007).) This latter work is focused on 



modehng degrees of correspondence between relational structures, and does 
not attempt to model the internal properties of rank-valued networks them- 
selves. This second problem is the focus of the present paper. 

For modeling of internal network structure, exponential-family random 



graph (ERG) or p* models ( 


Holland and Leinhardt 


1981 


Pattison, 1996 Robins, Pattison, and Wasserman 




1999 


) 



Wasserman and 



favored approach. Models parameterized in this way have been applied to 
social network data in a variety of contexts, including dichotomized rank- 



order data ( 


Krackhardt and Handcock, 


2007; 


Goodreau, Kitts, and Morris 


2008b 


) . Recently, 


Krivitsky 


(2012 


) formulated a generalized framework for 



exponential-family models on networks whose ties have values. In this work, 
we develop ERG models for ordinal relational data, focusing on the case of 
forced complete orderings, but with an eye toward more general ordinal data. 
In Section [2| we discuss representation of ordinal relational data and intro- 
duce the probabilistic framework for exponential-family models for them, 
and in Section |3} we describe statistics that could be used to model com- 
mon network properties within this framework. We discuss issues involved in 
implementation and statistical inference on these models in Section |4} Two 
applications of this framework are demonstrated in Section |5} 



2 Exponential- Family Framework for Ordinal 
Relational Data 

We begin this section by defining notation for representation of ordinal re- 
lational data and by establishing basic principles for using such data in a 
manner that respects its intrinsic measurement properties. Throughout this 
article, we will use "A^^" to refer to what could be called a "distinct Carte- 
sian power" of a set A: the set of distinct p-tuples of its elements. Formally, 

= {veAP: V,e{i..p}Vje{i..p}i =^ Vi ^ Vj}, 

where A^ is the pth Cartesian power of A. 

A rank- valued network consists of a set of actors, of cardinality n = \N\, 
together with a dyadic adjacency structure y whose i,j pairs reflect the rank 
of (alter) j with respect to (ego) i. Formally, y is a mapping from the set of 
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dyads (distinct ordered pairs of actors) Y = A^^^, to a set S for which there 
exists some binary operator, > expressing the ordinal sense of §. In general, 
we make few assumptions regarding >, using here only the constraint that 
for all {i,j,k) e A^^^, either y^j > y^^^ or y^^ i> y^^^. Importantly, y^^^ are 
not assumed to be comparable across egos: for each ego, we may only say 
that one alter is ">" another (or that this does not hold). The relation ">" 
could represent "preferred to", "interacted more with than", "judged to be 
taller than", or any other judgment of interest. 

With the additional assumption of transitivity, the above formulation 
leads to (y, >) representing a partial ordering of alters by each ego. Finally, 
a further constraint on > that ~'(?/i.j > yi^k) =^ yi,k > Vij results in 
a complete ordering, which may arise when an ego is forced to rank all of 
the alters with no ties permitted. This is an important special case, and we 
consider it here in more detail. 



Krivitsky (2012) suggests that a sample space of complete rankings of 
every actor in a network by every other actor can be represented by a di- 
rected network with no self- loops (Y = A^^^) and space of dyad values 
§ = {1 ..n — 1}, with the ranked nature of the data leading to a complex 
constraint structure for 3^: 

y = {y'ES'^: V,e;vV.es3!,e;v\{.}?/-,- = r}. (1) 

That is, each ego i assigns a unique ranking to each alter j. For convenience, 
the lowest rank is coded as 1 and the highest as n — 1. This formulation is 
slightly misleading in that elements of S have only ordinal and not interval 
or ratio meanings, and, as we noted above, are only ordered within the 
rankings of a given ego. That is, it makes sense to ask if for some y E y 
and {i,j,k) G A^^^, y^j > y^^/. — whether i had ranked j over k, but not 
to evaluate the difference between ranks {y^ j — y^ j.) or to compare ranks 
by different egos (y^- j > j/^ J. It does, however, represent distinct complete 
rankings in a concise and convenient manner, so we make use of it, with 
the proviso that the statistics evaluated on y make use of no operation 
other than comparison within an ego's rankings. To concisely represent this 
fundamental operation, we define an indicator 

y..>.-l' y^^^>y^^. (2) 

I otherwise 

Taking the set defined by ([T]) as our sample space, we can formulate an 
exponential family for (complete) rank-order networks by defining sufficient 
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statistics g : y ^ MP and parametrizing it by G (0 C M'?) which are 
mapped to canonical parameters via mapping 77 : — > W: 

rr0;,,,g(r -y) — , [6) 

with the normahzing factor 

«^r,,.W = 5^exp(77(0)-^(t/)). (4) 

The sufficient statistics g may also implicitly incorporate exogenous covari- 
ates X. For the purposes of this paper, we assume these to be fixed and 
known, and we further limit our development to linear exponential families, 
in which r]{0) = 6. 



3 Terms and Parameters for Ordinal Rela- 
tional Data 

We now introduce and discuss a variety of sufficient statistics g{-) for the 
model of ([s]) ("model terms") that abide by the restrictions discussed in 
Section |2] while viably representing phenomena frequently observed in social 
networks. For each term, we discuss how it may be interpreted, without 
assuming rankings to have more than ordinal meaning. 



3.1 "Promotion" statistics and interpretation 

For binary ERG Ms, Snijders, Pattison, Robins, and Handcock|(2006), Hunter 



Handcock, Butts, Goodreau, and Morris (2008b), and others have used 
change statistics, the effect on the model of a toggling of a tie — an atomic 
change in the binary network structure — to aid in interpreting the model 
terms. For complete ordering data, where changing the ranking of one in- 
dividual necessarily changes ranking of others, a candidate for an analogous 
atomic change must affect positions of as few actors as possible. Such a 
change is the swapping of rankings by two adjacently-ranked alters by a sin- 
gle ego. We thus use the effect of having ego i "promote" a promotable alter 
iE{kEN: k^i/\y^f.<n — 1}, swapping j's rank with that of the alter 
previously ranked immediately above j, as such a change. 
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Let 2/i,(r) be the alter j such that y^j — r, and let y^'^^^ be network y 
where i had swapped rankings of j and k. We define a promotion statistic as 

A{^g{y)^g{y'--''^^^-^yi^^^^^)-g{y). 

For the sake of readability, we will use as shorthand for 7/^ — the 
alter of i immediately above j. 

This quantity emerges when considering the effect of the promotion on the 
odds of observing a modified network (relative to some reference structure) 
under the model: 



That is, holding the rest of the network fixed, ego i is exp(r7(0) ■ A<^gr(?/)) 
times more likely to rank j one rank higher than j is ranked by i in than 
not, according to the model. Note that promotion statistics are mainly useful 
for complete orderings: if the ordering is partial, it is possible for i to promote 
j without demoting j+. 



3.2 Exogenous factors 

We begin our quorum of substantively useful statistics by considering "ex- 
ogenous" factors: those factors that would influence rankings by an ego i 
in a manner that is independent (in the probabilistic sense) of the rankings 
of all other egos i' G A^\{i}. Substantively, these factors are exogenous to 
the ranking process in that they arc not, at least on the time scale of the 
process, mutable, or in that they operate independently of an ego i being 
able to observe or infer rankings or other salient states or actions (that are 
endogenous to the model) of any other ego i'. 

3.2.1 Attractiveness/Populeirity Effects 

For assessments of attractiveness, liking, and status, it is likely that egos' 
rankings will be influenced by some relatively stable (and exogenous) ten- 
dencies of particular alters to rate more highly than others. For instance. 
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assessments of physical attractiveness tend to be broadly consistent within a 
given cultural context, and such assessments correlate positively with phys- 
ical attributes and performance characteristics (e.g., subtleties of dress and 



speech) that are usually difficult to alter over short time scales (Morse, Reis, 
[Gruzen, and Wolff, 1974 Webster and Driskell 1983). Thus, we have the 



emic notion that some persons "are attractive" , with the attribution regarded 
as an fixed trait of the person being assessed; while the reality is less triv- 
ial, stable factors governing attractiveness are sufficiently important that we 
may wish to capture them where possible. In other settings, institutionalized 
status characteristics (e.g., group membership, formal social roles) or the like 



Norman, and Zelditch 1977) 



may have similar effects (Berger, Cohen, and Zelditch, 1972; Berger, Fisek 



Regardless of source, we can treat these effects directly by positing some 
covariate £C G M", associated with a statistic of the general form 



9a 



ttract. 



{y;x)= yi:j>kixj 



Xk 



This statistic simply indexes the tendency for those with higher values on x 
to rank more highly than those with lower values. The promotion statistic 
associated with the above is of the form 



ttract 



2{xj 



x^ 



i.e., twice the difference between the attractiveness of j and the actor over 
whom j may be promoted. Thus, the effect of this term on the odds of i 
promoting j is exp(277Attract.(^)(aJi - As expressed, f/ Attract, treats x 

as at least an interval scale; in other cases, the subtraction operator would 
need to be replaced with a more appropriate function. While x may be an 
observed covariate, it is worth noting that this quantity is also a natural 



candidate for treatment via a random popularity effect (van Duijn, Snijders, 



and Zijlstra 2004) 



3.2.2 Difference/Similarity Effects 

Just as one may posit differential tendency to "win" ranking contests overall, 
one may also posit that each actor i has exogenous characteristic Xi G X 
such that alters "close to" or "far from" ego will be more likely to be highly 
ranked than those with the reverse attributes. This is a familiar application 
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of homophily/heterophily to the rank order case, and the implementation is 
straightforward: 

fl'HomophX?/) ^) = yv.j>ki^i^i^^j) ~ ^i^U^k)) , (5) 

{i,j,k)(^N^^ 

where z : — M is any function that is monotone increasing in the difference 
between its arguments. Thus, where this statistic is enhanced we expect "far" 
actors to outrank "near" ones (from the point of actor i), with the reverse 
holding where this statistic is suppressed. The atomic effect of this term is 
simply 

^(j9Homoph.{y, X) =2 {z{Xi, Xj) - z{Xi, Xj+)) . 

As with attractiveness, difference effects can be based either on observed 
covariates or on latent quantities. 



3.2.3 Dyadic covariates 

We can extend the above logic to general dyadic covariates. For instance, 
we may consider a case in which a within-context ranking is made by ac- 
tors having ongoing social relationships; we might expect, then, that actors 
engaged in positive long-term relationships would tend to give preference to 
their partners within the specific rating context. Statistics for this behavior 
can be produced like so: 

fl'Dyad(?/! ^) = Vi: j>k i^ij ~ Xi,k) 

{i,j,k)&N'-i^ 

^(j9Y)y^A{y^ X)=2 [x,j - Xij+) . 

Of course, the cases of attractiveness and difference described above are 
simply special cases of dyadic covariates, with particular structure imposed. 



(Notably, the matrix permutation family (ERGP) of Butts (2007a) has a 
somewhat similar structure.) 



3.2.4 Comparison covariates 

Finally, in the framework of pairwise comparison, the most general exogenous 
covariate form assigns a weight to each pairwise comparison by each ego: 

fl'Pairwised/) ~ ^ ] Vi: j>k'^i,j,ki 

{i,j,k)&N37t 
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for some x G M , with 

This statistic has all other exogenous statistics as special cases. 



3.3 Endogenous factors 

We now turn to factors that are endogenous, in the sense that, unlike exoge- 
nous factors, their effect on the rankings by ego i does depend on rankings 
by other egos i' G A^\{i}. Substantively, these are factors for phenomena 
that may plausibly arise in cases for which ego observes or is able to infer 
the rankings of others. 



3.3.1 Global conformity 

In many settings where an ego is able to observe or infer the rankings of 
others, there is reason to presume that this will influence ego, so that he 
or she will tend to bring his or her own rankings into conformance with 
others'. This is certainly true in dominance or status rankings, where there 
is considerable evidence that individuals can and do infer status ordering from 



observation of third-party judgments (see, e.g., Anderson, Srivastava, Beer, 



Spataro, and Chatman, 2006); this synchronization may even be explicit, 
as in certain types of gossip (wherein two or more parties "compare notes" 
on the relative status of their peers). The status of influence for relations 
such as relative liking is less clear, but still plausible: ego may take alter's 
evaluations of the relative merits of other alters into account in assessing his 
or her own preferences, just one can be influenced in one's judgment of the 
merits of food, art, or other experiential goods by the evaluations of others. 
Finally, the mutual observability of rankings may produce in some settings 



a form of "conformity pressure" (Asch, 1951, for example), such that those 



displaying deviant rankings anticipate (and are possibly exposed to) sanction. 
The importance of influence processes in such settings is well-documented. 

To formalize influence in the ranking context, we must note that four 
elements are involved: ego's assessment of two alters (say, j and /c), and the 
assessment of those same alters by a distinct third party (say, / ^ {hj-, k}). 
Denoting ego by i, we note that ego's assessment of j and k is in conformity 
with /'s assessment of j and k when yi,jyk = Vi:j>k Vi-.kyj = Uukyj- ^ 
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natural statistic to summarize the degree of ratings nonconformity, then, is 



9GNc(y) = Yl [yi:j>k (i - yi:j>k) + (i - yi:j>k) yi:j>k] ■ (6) 

{i,j,k,i)£m^ 

The promotion statistic for nonconformity can be derived by observing that 
when i promotes j over j+, the statistic is incremented by 2 for every other 
ego / who has j+ ranked over j and decremented by 2 for every / who has j 
ranked over j+. Thus, 

l(iN\{i,j,j+} l(^N\{i,j,j+} 

a "vote" among the Is as to the relative ranking of j and j+. Insofar as 
influence is active, Qq^q should be suppressed (and the associated parameter 
negative). In the typical case of total ordering within subjects, sufficiently 
strong suppression of gfcNC ^i^^ force the population to converge to a universal 
consensus ranking; if the suppression is weaker, a looser but analogous set of 
states will be favored. 

It should be noted that this form of influence and the random attractive- 



ness effect, mentioned in Section 3.2.1 can both explain the same network 



feature: both heterogeneity in attractiveness and social influence induce an 
agreement in rankings, and the latter may be considered a marginal represen- 
tation of the former, in a manner similar to that of a within-group correlation 
as a marginal reflection of a (conditional) random effects linear model. 



3.3.2 Local conformity 

For global nonconformity gfcNC' promotion statistic ([T]) implies that i 
weights agreement with every other / equally, regardless of how i had ranked 
/. In some cases, it may be plausible that the salience of / for i may depend 
upon i's ranking of him or her; for instance, i may be more likely to attend 
to (and to conform to) those whom he or she ranks highly than those whom 
he or she ranks lower. A possible formalization of this is the notion that 
i's ranking of j would be influenced by / only if i ranks I over j, so only 
actors ranked above j influence z's ranking of j. As with global conformity, 
we define this statistic, the local nonconformity, negatively: counting the 
number of instances where an ego had ranked / over j but then did not 
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conform to /'s ranking of j relative to some fourth alter k: 



9LNciy)= Yl Vv.i>jyi:j>k{^-yr.j>k)- 

{i,j,k,l)&N^7^ 

The atomic effects for this statistic are somewhat complex: 

^ij-fl'LNcd/) = ( + yi:k>j+yk:j+>j ~ y i: k>jy k: j>j^ 

k(^N\{i,j,j+} 



+ yk:i>jyk: 



k:j+>j yk:i>j+yk:j>j 



y j:j+>kyi:k>j+ y j+ : j>ky i: k>j) 



(8) 



(9a) 
(9b) 
(9c) 



They have, however, a meaningful interpretation. The two terms (9a) repre- 
sent the effect of i bringing his or her ordering of j and into conformance 
(or disconformance) with those of some actor k whom i had ranked higher 



than at least one of them. The pair (9b) represents the situation where some 
actor k ranks i over j and/or j+, so i promoting j over j+ either creates or 



eliminates nonconformity on the part of k. The pair (9c) represent the notion 



that nonconformity is also created if actors i and j disagree on the ordering 
of and some actor fc, so i promoting j over j+ creates nonconformity by 
making j's ordering of j+ salient to i. (Ego i can resolve this tension either 
by changing the ranking of j+ and k to conform with j (affecting (9a)) or by 
demoting j to make his or her ranking less salient.) 



3.3.3 Deference aversion 

Influence, as defined above, deals with the mutual adjustment among raters 
regarding their relative assessments of third parties. When ego is a party 
to the rating in question, the situation becomes more complex. By assump- 
tion, ego does not explicitly self-rate; thus, ego cannot adjust towards alter's 
impression of him or her. In many settings, however, another mechanism 
may be active that will make alter's ranking of ego salient for ego's rank- 
ing of alter. In particular, consider the case in which higher rankings are 
associated with positive evaluation, such that being ranked below others is 
aversive. Moreover, let us assume that ego infers his or her own status via an 
implicit transitivity mechanism, such that if alter / ranks j above ego {i) and 
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ego ranks / above j, then ego is for social purposes ranking him- or herself 
below /. Under such circumstances, deference aversion may lead ego to resist 
ranking / above j. 

To capture this notion with a statistic, we propose the following: 



We expect this statistic to be suppressed where deference aversion is present. 
The promotion statistic is incremented if j+ had ranked i over j, since it 
creates a deference of j+ to i via j; or if j had ranked j'^ over i since it 
creates a deference of i to j via k; and it is decremented if j+ had ranked j 
over i, as it would eliminate the deference of i to j+ via j; or if j had ranked 
i over as it would eliminate the deference of j to i via Thus, 



It is interesting to note that the principal effect of suppressing this statistic 
is actually to bring ego's rankings in line with those of alter, somewhat akin 
to the reciprocity or mutuality in binary relations. Specifically, if there are r 
persons ranked by alter as being above ego, then ego will also tend to rank 
those same r persons as being above alter. Where a total order is present, 
ego and alter will thus tend to give each other the same rank (and, indeed, 
to agree on those persons having higher ranks). Of course, applying this 
logic to all pairs suggests pressure towards equality, which is impossible to 
achieve in the total order case (but not necessarily for others). Even in the 
case of total orders, however, considerable variation in fl'Defer. possible, with 
lower values indicating rating structures in which agreement between raters 
on high-ranked alters is maximized. 

3.4 Consistency across settings 

When ranking the same alters among multiple settings — across time or across 
rubrics — there is reason to expect that ego will tend to exhibit consistency 
in alter ratings. Across time, this is an exogenous effect, because earlier 
rankings cannot be influenced by later rankings. Across rubrics, it may be 
endogenous. Here, we assume two rating structures, y and y', on vertex sets 
and A^', such that some set = N n N' of actors are involved in both 
networks. For convenience in notation, we take the labeling of the members 





(10) 



12 



of Ns to be the same in both and A^'. Given this, our statistic measuring 
inconsistency is simply 

QlnconiV. y') = Yl {yi:j>k " V'r.jyk) + (l " Vr.jyk) V'i-.jyk) , (H) 

with promotion statistic being simply 

^{j9incon.{y; y') = '^iy'i:j+>j - y'r.j>j+)- 

As fliincon. measures the discordant pairs of rankings in y versus y', suppress- 
ing it implies higher levels of cross-context consistency. 



The statistic ( 11 ) treats all disagreements between y and y' as equivalent. 
It may be the case, however, that only some disagreements are of interest, or 
disagreements themselves need to be modeled. This can be facilitated by a 
more general form of gfincon.- Given weights x E M^^^, symmetric such that 
Xij^k = Xi,k,j, let the weighted inconsistency 

fflncon.(2/; y', X)= {yi-j>k (1 - y'i:j>k) + (1 - yt:j>k) yr.j>k) Xi,j,k- 

{i,j,k)&N!^ 

(12) 

X can, itself, be parametrized to model factors affecting disagreement be- 
tween two rankings. 



4 Inference and implementation 

The proposed class of models is a finite exponential family over a finite sample 
space. Thus, at least in the non-curved case with t]{0) = 0, its natural 
parameter space ©n = {0' : Kr,^g{0') < oo} = W^, an open set, so the models 
from this family are regular and posses all of the inferential properties of 



such families. (Brown, 1986, pp. 1-2) 

We briefly discuss practical issues associated with fitting the models of 
this class. 

4.1 Implementation 

The sample space of a complete ranking ERGM is finite but large {\y\ = 
{{n — 1)!)"), so even for networks of modest size, evaluating the normalizing 
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constant ^ is not computationally feasible. However, provided a method 
for sampling from this ERGM distribution FT0.rj^g{-) exists, the MCMC MLE 
method can be used to fit the model. (Geyer and Thompson 1992; Hunter 



and Handcock 2006 Krivitsky 2012) We describe a simple algorithm for 



sampling from an ERGM for complete ordering defined in Section [2] in the 
Appendix. 

We base our implementation on the extensions of Krivitsky (|2012 ) to 
the R package ergm for fitting binary ERGMs. (Handcock, Hunter, Butts, 



Goodreau, Krivitsky, and Morris, 2012), using the valued representation de 



scribed in Section |2] and Algorithm [T] to draw samples for the MCMC MLE. 
We expect to incorporate this implementation into a future pubhc release. 
Lacking a general procedure such as the maximum pseudo-likelihood es- 



timation (MPLE) (Strauss and Ikeda, 1990) to use as a starting point for 
optimization, we initiated the MCMC MLE optimization at = in each 
of our applications. This parameter configuration corresponds to a uniform 
distribution on the space of possible rankings. 



4.2 Degeneracy 

Degeneracy, in its many meanings, is often a concern in applying ERGMs. 



Despite recent progress (Rinaldo, Fienberg, and Zhou 2009 Butts 2010 



Schweinberger 2011, for example), few general and unambiguous diagnos- 



tics for degeneracy exist beyond simulated goodness-of-fit measures (Hunter, 



Goodreau, and Handcock, 2008a) and diagnostics of symptoms such as poor 



convergence of the MLE-finding procedure and, in particular, poor mixing 



of the MCMC MLE algorithm (Goodreau, Handcock, Hunter, Butts, and 



Morris, 2008a), often due to multimodality of the distribution of networks 



under the model. (Snijders et al. 2006 Rinaldo et al. 2009; Krivitsky 2012) 



In our case, there is some cause for concern due to the high order of some 
of the sufficient statistics we propose. For example, gfcNC a summation of 
indicators over a set of cardinality |A^^^| = 0{n^). Its promotion statistic 
^ij-fl'GNC is ^ ^ set of 0{n). Thus, there may exist configurations 

for which a small change in the ranking leads to a massive change in the 
value of this statistic and thus in the likelihood. Such statistics can induce 
excessively strong dependence in many cases lead to asymptotic degeneracy 



(Butts, 2010 Schweinberger 2011). 



At the same time, a number of factors is likely to ameliorate such prob- 
lems for the specific case of rank data. A network of ranks simply contains 
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more information than a network of binary outcomes of the same size. More 
concretely, a uniform distribution over binary directed networks of size n 
with no self- loops has entropy of n(ra — 1) bits, while a uniform distribution 
over rank networks has entropy of log2 ((n — 1)!)" = ri log2(?T- — 1)! bits, which 
grows much faster in n. This can reduce the risk of pathological model be- 
havior due to poor identification in the small-n case. Also, while there exists 
the potential for problematic statistics and configurations, the constrained 
nature of the sample space makes degeneracy more difficult to achieve than 
is the case for typical graphs or digraphs: an ego cannot promote one alter 
without demoting another, and while some statistics, like Qq^q-, have an ex- 
treme case of all actors agreeing, others, like g-^eier.i cannot be reduced to 
0. 

In all of the following examples, we found no symptoms of degeneracy: all 
estimators converged without issues to estimates for which the expectations 
of sufficient statistics matched their observed values, and MCMC diagnostics 
showed no sign of poor mixing or multimodality. That these MLEs were 
found and converged to despite the starting point, again, suggests that 
models using the terms introduced here are quite robust. 



5 Examples 

5.1 Dynamics of acquaintance process 



From 1953 to 1956, a research group led by Theodore Newcomb (1961) con- 
ducted an experimental study of acquaintance and friendship formation. In 
each of the two study years, 17 men attending the University of Michigan — 
all transfer students with no prior acquaintance among them — were recruited 
to live in off-campus fraternity-style housing. Demographic, attitudinal, and 
sociometric information was collected about the subjects. In particular, in 
the second year, at each of 15 weekly time points (with week 9 being miss- 
ing), each participant was asked to rate all other participants on "favorable- 
ness of feeling", with ratings forced to be distinct and converted to ranks. 
(1961, pp. 32-34) These data represent an example of longitudinal data of 



complete ranks, and, particularly the data from the second year of the study, 



have been used to study the formation of interpersonal relationships by New- 



comb 


(1956|), 


Breiger et al. 


( 


1975 


), 


White et al. 


(1976 


), 


Arable et al. 


(1978 


), 


Wasserman ( 


1980 


), 


PattisoE 


( 


1982 


), 


Nakao and Romney 


( 


1993 


), I 


Doreian 
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et al. (1996), Krackhardt and Handcock (2007), and many others. 



We use ERGMs for ordinal data to study this network, examining the 
social forces relevant to its structure and its evolution over time. We take 
two distinct approaches: cross-sectional, where each time point's network 
structure is modeled on its own and dynamic, where each time point but 
the first is effectively modeled as a change from the previous time point's 
rankings. 



5.1.1 Cross-sectional analysis 



Demographic data, including age, religion, and political views of the subjects 
were gathered. Also, within the house, the subjects were assigned to rooms, 
spread over two floors of the house — three one-occupant rooms, four two- 
occupant, and two three-occupant rooms. (Newcomb, 1961, pp. 67-68) 
Furthermore, while some were assigned to rooms at random 



pp. 

others were 



assigned with an aim to maximize (for some rooms) and minimize (for other 



rooms) the roommates' compatibility as understood by the researchers (1961 



pp. 216-220). All of these factors could be used as predictors in our modeling 
framework via terms introduced in Section|3.2l However, these measurements 



do not appear to be publicly available, so we focus on the endogenous effects. 



although the "birds of a feather or friend of a friend" ( Goodreau et al. , 2008b ) 



caveat applies. For each of the 15 networks, we model deference aversion (via 



fl'DcfcrA^ ([10])) and glo bal (v ia Qg^qO^ ) ([6])) and local (via silncI _ 
conformity. Per Section 3.3. 1[ the suppressing of the global nonconformity 
statistic produces the same effect as latent actor attractiveness, so it may, in 
this case, be viewed as modeling latent heterogeneity in popularity. 

We report the maximum likelihood estimates for each of the terms over 
time in Table [T] and plot them in Figure [Tj Deference aversion is significant 
(the coefficient on the deference statistic is negative) throughout the evolu- 
tion of rankings, starting with the first point of observation. This is consistent 



with the finding of Newcomb (1956), Doreian et al. (1996), and others that 



"friendships are reciprocated immediately" . Our analysis, however, suggests 
deepening deference aversion over time, not reaching its ultimate magnitude 
until week 4 or 7. (Informally, the estimated Kendall's rank correlation be- 
tween the parameter estimate and week number is f = —0.43, significant 
with P-value = 0.028.) One explanation for this difference is that prior 
analyses, including that of Doreian et al. (1996), dichotomized the dyads in 



the network. Our approach uses the entire ranking and may thus be more 
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Table 1: Results for cross-sectional analysis of the Newcomb's data 

Estimates (Std. Errors) 

Nonconformity 

Week Deference Global Local 






-0 


1539 


(0 


0398) 


-0 


0020 


(0.0016) 


-0.0098 


(0.0100) 


1 


-0 


2185 


(0 


0463) 


-0 


0005 


(0.0016) 


-0.0198 


(0.0092) 


2 


-0 


2203 


(0 


0471) 





0012 


(0.0016) 


-0.0304 


(0.0087) 


3 


-0 


2129 


(0 


0467) 





0019 


(0.0015) 


-0.0362 


(0.0080) 


4 


-0 


2866 


(0 


0586) 





0009 


(0.0014) 


-0.0353 


(0.0079) 


5 


-0 


2505 


(0 


0587) 





0006 


(0.0014) 


-0.0400 


(0.0077) 


6 


-0 


2418 


(0 


0574) 


-0 


0002 


(0.0014) 


-0.0357 


(0.0081) 


7 


-0 


4004 


(0 


0822) 





0016 


(0.0012) 


-0.0446 


(0.0073) 


8 


-0 


3791 


(0 


0744) 





0006 


(0.0013) 


-0.0368 


(0.0075) 


10 


-0 


3118 


(0 


0708) 





0014 


(0.0012) 


-0.0455 


(0.0073) 


11 


-0 


2553 


(0 


0593) 





0013 


(0.0013) 


-0.0454 


(0.0077) 


12 


-0 


3017 


(0 


0661) 


-0 


0001 


(0.0013) 


-0.0354 


(0.0078) 


13 


-0 


1758 


(0 


0508) 





0012 


(0.0014) 


-0.0462 


(0.0080) 


14 


-0 


3798 


(0 


0794) 





0010 


(0.0013) 


-0.0439 


(0.0075) 


15 


-0 


3364 


(0 


0746) 





0005 


(0.0013) 


-0.0434 


(0.0076) 



Coefficients statistically significant at a = 0.05 are bolded. 

Standard errors incorporate the uncertainty introduced by approximating of the likelihood 
using MCMC ([Hunter and Handcock[ |2006[). 
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sensitive. 

The local nonconformity term is also significant for all but the first obser- 
vation point, although its effects seem to emerge more gradually than those 
of deference aversion (Kendall's f = —0.58, P- value = 0.002), leveling off 
around Week 7 or perhaps even later. This does agree with|D oreian et al. 



(1996), although others have suggested earlier and later times when the net- 
work stabilized. 

In the presence of the local nonconformity term, the global nonconfor- 
mity term is not significant: there does not appear to be a significant overall 
consensus in ratings, although Newcomb (1956) reports that three specific 
subjects were generally disliked by everyone, including each other, so perhaps 
failing to detect this is a result of lack of power and presence of the local non- 
conformity term. Notably, the estimated correlation between the local and 
global nonconformity parameters (within each week's fit) is strongly nega- 
tive and consistent (—0.87 — 0.79). This suggests that these terms explain 
similar behavior, though, significance of one and not the other indicates that 
conformity is primarily to those more liked. 



5.1.2 Dynamic analysis 



We now turn to modeling the evolution of the rankings over time. For our 
dynamic analysis, we use a simple Markov formulation similar to those of 



Krackhardt and Handcock (2007) and Hanneke, Fu, and Xing (2010) for the 



evolution of rankings over time: 



having the normalizing constant 



y 



exp{r){e) ■ g{y*;y^ )) 



;0,?/*-^) = 5^exp {vi0)-9iy';y'-')). 

y'ey 



For each of the 14 transitions between successive networks, in addition to the 
three terms used in the cross-sectional analysis, we model inconsistency over 
time via gi^^„^{Y^;Y^^^) (11). This term effectively absorbs the structure 
in the network at t that is due to inertia and to social forces operating in the 
time periods prior to t — 1, and thus the other terms model the social forces 
affecting only the changes in the rankings over time. 
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Table 2: Results for dynamic analysis of the Newcomb's data 



Estimates (Std. Errors) 
Inconsistency Nonconformity 
Week with Prior Week Deference Global Local 



1 


-0 


1341 


(0 


0149) 


-0.1896 


(0 


0474) 


0.0003 


(0 


0017) 


-0.0190 


(0 


0094) 


2 


-0 


2116 


(0 


0181) 


-0.1663 


(0 


0522) 


0.0022 


(0 


0020) 


-0.0311 


(0 


0099) 


3 


-0 


2437 


(0 


0215) 


-0.1258 


(0 


0541) 


0.0009 


(0 


0021) 


-0.0291 


(0 


0104) 


4 


-0 


2802 


(0 


0236) 


-0.2057 


(0 


0686) 


-0.0037 


(0 


0018) 


-0.0261 


(0 


0092) 


5 


-0 


2946 


(0 


0252) 


-0.1309 


(0 


0659) 


-0.0025 


(0 


0022) 


-0.0303 


(0 


0102) 


6 


-0 


3458 


(0 


0274) 


-0.1251 


(0 


0690) 


-0.0014 


(0 


0025) 


-0.0164 


(0 


0111) 


7 


-0 


3902 


(0 


0332) 


-0.3906 


(0 


0989) 


0.0040 


(0 


0026) 


-0.0457 


(0 


0115) 


8 


-0 


3075 


(0 


0267) 


-0.2246 


(0 


0847) 


-0.0009 


(0 


0024) 


-0.0138 


(0 


0109) 


10 


-0 


3004 


(0 


0247) 


-0.1607 


(0 


0785) 


-0.0001 


(0 


0023) 


-0.0346 


(0 


0107) 


11 


-0 


2891 


(0 


0255) 


-0.1247 


(0 


0667) 


-0.0003 


(0 


0022) 


-0.0247 


(0 


0101) 


12 


-0 


3259 


(0 


0277) 


-0.2147 


(0 


0762) 


-0.0032 


(0 


0024) 


-0.0087 


(0 


0112) 


13 


-0 


3729 


(0 


0316) 


-0.0178 


(0 


0654) 


0.0030 


(0 


0027) 


-0.0403 


(0 


0125) 


14 


-0 


3459 


(0 


0302) 


-0.3406 


(0 


0896) 


-0.0015 


(0 


0024) 


-0.0305 


(0 


0108) 


15 


-0 


3089 


(0 


0280) 


-0.1947 


(0 


0839) 


-0.0022 


(0 


0023) 


-0.0246 


(0 


0108) 



CoefRcients statistically significant at a = 0.05 are bolded. 

Standard errors incorporate the uncertainty introduced by approximating of the likelihood 
using MCMC (iHunter and Handcockl 12006 ). 



Because we seek to examine the strengths of the factors over time, we 
use time- varying parameters (although Krivitsky and Handcock (2010) show 
that the approach of Hunter and Handcock (2006) can be applied to series of 
networks or transitions as well). Week 9 rankings were not reported. Because 
of this, for Week 10, we fit the parameters for transition from Week 8. 

The maximum likelihood estimates for each transition are reported in 
Table |2] and visualized in Figure |2j The estimates for the transition from 
Week 8 to Week 10 do not appear to be qualitatively different from those 
from nearby transitions. In particular, inconsistency does not appear to be 
higher over this particular two-week period. 

The clear downward trend (f = —0.56, P-value = 0.005) in inconsistency 
over successive weeks suggests that the rankings are initially in flux as the 
acquaintance process takes place, solidifying over time. As before, global 
nonconformity is not a significant factor. The parameter estimates for the 
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other two factors still appear to be, on the whole, significant, but are uni- 
formly smaller in magnitude compared to those of the corresponding weeks 
in the cross-sectional analysis and are less precisely estimated (as represented 
by uniformly greater standard errors). This is because rather than embody- 
ing the structure of the whole network, they embody only the structure of 
changes in the network over the week, and information to infer their strength 
is drawn only from those changes. This means that some "instant" social ef- 
fects such as friendship reciprocation have been absorbed into the Week 
observation, which is not modeled in the dynamic analysis. 

In contrast to the cross-sectional analysis, neither deference nor local 
nonconformity appear to have a significant monotone trend over time (both 
correlations have P- value > 0.5). That is, while they are time- varying when 
viewed cross-sectionally, the effects of these social forces over and above in- 
ertia are fairly consistent over time, at least after the initial time point. This 
suggests that this modeling approach may be successfully isolating social 
forces as they affect actors' behavior over time from the effects of preexisting 
configurations. 



5.2 Informant accuracy 



In the late 1970s, Bernard, Killworth, Kronenfeld, and Sailer (1984) con- 



ducted a series of studies to assess the accuracy of retrospective sociometric 
surveys of several types. In each study, respondents in a social network — 
deaf teletype users; amateur radio operators; office workers at a firm; stu- 
dents in a fraternity; and faculty, graduate students, and staff in an academic 
program — had their social interactions observed or recorded and were asked, 
in retrospect, to indicate others in their network with whom they interacted, 
allowing recalled and observed network structures to be compared. In the 
latter study — conducted in a graduate program in technology education at 
West Virginia University — the 34 subjects had the frequency of their inter- 
actions recorded by a team of observers over the course of a week, and then 
each subject was asked to provide a complete ranking of the other subjects 



on "most to least communication that week" (Bernard and Killworth, 1977). 



This produces a complete ranking, suitable for analysis using our methods. 



20 



5.2.1 Modeling inconsistency 



In this apphcation, we use models with sufficient statistics of the form (11) 



and (12) to assess factors that appear to affect accuracy of rankings. Let 



y & y he the reported rankings; and let y'^^ e Nq be a weighted symmetric 
graph of the observed frequencies of interaction, so that yfj is the number of 
times i and j were observed interacting. Like y^~^ in the previous example, 
it is exogenous in our framework. 

For notational convenience, we reexpress (|3| and ^ as 



exp(E(»j,fc)e^f^ {y^■.j>k (1 - yl:j>k) + (1 - yi:j>k) yf.j>k) ^i,j,k{o)) 



(13a) 



where 

WijA^) = ■ Xij^k, (13b) 

for covariate array x G M^^^^i^ so that Xij^^ is the p- vector of covariates 
associated with i's comparison of j and k, and, as before, Xij^k = s^i.fcj- This 
allows us to model inconsistency in a form semblant of logistic regression. 
Unlike logistic regression for accuracy of pairwise comparisons, this model 
takes into account the dependence between the comparisons that is induced 
by the structure of the sample space. (I.e., that t/j.^^^ A t/j. =^ yi:j>i-) 

Because there are "ties" among the observed interaction frequencies (3(j j ^jg^va?^?/?*;- 
yfk)y while the reported rankings are forced to be complete (no ties were al- 



lowed), there is no configuration of rankings y E y, such that statistic (11 ) is 
— that the reported rankings are completely consistent with those observed. 
This reveals an interesting property of the proposed class of models: because 
the comparisons that are tied in the observed frequencies simply add a con- 
stant to their sufficient statistic, their effect on the likelihood is canceled by 
the normalizing constant. That is, inconsistent comparisons only affect the 
model and the estimation if it is possible for them to be consistent is in the 
sample space. 

For convenience, further let y°^^' be the observed rank of j among those 
with whom i had interacted, with 33 being code for the highest frequency, 1 
being code for the lowest frequency, and ranks for tied alters (i.e., t/^^ = yf^^, 
for k 7^ j) being computed by averaging the "positions" that these alters 
share. 
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Table 3: Effect of frequency of interaction on reporting inaccuracy 





Estimates (Std. Errors) 


Term 


Frequency 


Rank 


Intercept 


-0.0659 (0.0206) 


-0.1582 (0.0247) 


Frequency difference 


-0.0183 (0.0034) 




Frequency rank difference 




-0.0026 (0.0031) 



Coefficients statisticaliy significant at a = 0.05 are bolded. 

Standard errors incorporate ttie uncertainty introduced by approximating of tfie likeiifiood 



using MCMC (Hunter and Handcock 2006) 



5.2.2 Effect of frequency of interaction 

The first question we address is whether the magnitude of the difference in 
the frequency of interaction affects the accuracy. That is, if i's frequency of 
interaction with j differs from i's frequency of interactions with k by more 
than i's frequency of interaction with j differs from i's frequency of interaction 
with /, is i more likely to rank j and k accurately than j and /? 

To answer this, we begin by fitting a simple model with two covariates: 
^i,j,k,i = 1 in the form (13), equivalent to plain inconsistency (11); and 



^i,j,k,2 = IVij ~ yi%\, the absolute difference between the interaction fre- 
quency of i with j and k. We report the results in Table |3j Greater difference 
in interaction frequency of two alters does appear to lead to greater accuracy 
(i.e., lower inaccuracy) in reporting their relative ranks. 

We also fit a similar model where we replace frequency difference with 
frequency rank difference: aJij,fc,2 = IVij^' ~ Vij^']- We find that the effect 
of rank difference is, as expected, negative, but not statistically significantly 
so. This lack of significance may be counterintuitive, but it is, in fact, a con- 
sequence of the constraints imposed by the sample space. Intuitively, given 
that i ranks j and k adjacently, an inaccurate reporting of the pairwise com- 
parison of these alters (e.g., yfj-^k but Vi^kyj) does not entail misreporting 
any other pairwise comparisons, including those involving j or k; but if j and 
k are some d > 1 ranks apart in y'^^ — they have d — 1 other alters between 
them — then inaccurately reporting of the pairwise comparison of j and k en- 
tails inaccurately reporting the pairwise comparisons Vi^j^i and/or Vi^'i^^i for 
every / ranked between j and k. It can be shown easily that any configuration 
y in which y^. j^^j ^ yf"j>k "lust also misreport at least d — 1 such compar- 
isons. Thus, even a model with no rank difference effect and only a baseline 
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inconsistency effect would already heavily penalize inaccurate reporting of 
comparisons between distantly-ranked alters. 



5.2.3 Effect of salience 

The second question that we address is whether the accuracy of reported 
ranking is affected by the positions of those being ranked. Can an ego i 
better discern the ranking of those with whom he or she interacts the most? 
Is he or she more accurate at the extremes? 

To answer this, we fit a model for inconsistency that is a quadratic poly- 



nomial in rank values. More concretely, in the form (13), Wij^kiG) has the 
covariate vector 

= [I, ytr- + yfk^ iytff + ■] , (14) 

inducing a model in which the inconsistency of reported with observed is 
modeled as a second-degree polynomial function of the observed ranks of the 
alters being compared; and this function is symmetric for these two alters. 



The statistics in ( 14 ) represent baseline inconsistency, linear effect of ranks of 
the alters being compared, their quadratic effect, and their interaction effect, 
respectively. 

In fitting this model, we found that it suffers from collinearity, which 
impedes inference, so to improve its numeric conditioning, we fit an equivalent 
model, using rescaled and centered quantiles, evaluating y1'^ - = {yf^-"- — YJ) /2,2 



and substituting for y in (14). 



We report results from this fit in Table |4| All four covariates appear to be 
highly significant. Somewhat surprisingly, the higher-ranked alters appear to 
have slightly higher inconsistency between observed and reported. However, 
the negative coefficient on the quadratic term suggests that the middle ranks 
are reported with the least accuracy of all. We show the predicted incon- 
sistency weight Wij^kiG) as a function of their observed ranks in Figure [3] 
From this, it appears that, indeed, for alters whose observed rankings are 
close together, the accuracy is lowest if the alters are ranked in the middle. 
(For alters whose observed rankings are far apart, the accuracy is higher.) 

6 Conclusion 

Rank-order data are a cornerstone of psychological and sociometric mea- 
surement, but principled treatment of such data in an interpersonal context 
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Figure 1: Estimated coefficients for the cross-sectional model fit to each 
week's rankings in the Newcomb's Fraternity. Error bars are at 95% confi- 
dence. 



Table 4: Effects of alter positions on reporting inaccuracy 



Term 


Estimate (Std. Err.) 


Wald P-value 


Intercept 


-0.0624 (0.0236) 


0.0082 


Linear 


0.0817 (0.0171) 


< 0.0001 


Quadratic 


-1.2991 (0.2938) 


< 0.0001 


Interaction 


1.7604 (0.5107) 


< 0.0006 



Coefficients statistically significant at a = 0.05 are bolded. 

Standard errors incorporate the uncertainty introduced by approximating of the likelihood 
using MCMC ([Hunter and Handcock[ |2006 ). 
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Inconsistency with Prior Week 
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Figure 2: Estimated coefficients for the longitudinal model fit to each week's 
rankings in the Newcomb's Fraternity. Error bars are at 95% confidence. 
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Observed rank of j by i 



Figure 3: Predicted effect of alter positions on reporting inaccuracy 
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poses significant statistical challenges. Here, we have shown how statistical 
exponential families may be used to generalize the now well-known ERGM 
framework to the rank-order case. We have also introduced a corresponding 
set of sufficient statistics that are appropriate for use when only within-ego 
ordinal judgments are psychologically meaningful, a restriction that is im- 
portant when modeling such data. As with conventional ERGMs, a wide 
range of statistics may be posited to capture alternative psychological and 
social mechanisms; the ability to evaluate and compare competing models 
based on such distinct alternatives is one of the strengths of the statistical 
approach. 

It should be noted that, in our development, we focus on the case of 
orderings that are defined from the psychological process in question, rather 
than data that are ordinal simply due to limitations in measurement (e.g., 
count or continuous data observed only as ranks). Although the present 
framework may in some cases be useful for such data (e.g., to avoid having 
to model the count distribution), the assumptions involved, e.g., in the choice 
of sufficient statistics (and their interpretation), may be quite different. 

This work has focused on complete orderings, but the above distinction 
gains further importance when considering partial orderings: the case of 
partial orderings being a property of underlying psychological phenomena 
is substantively different from the case of incomplete orderings arising from 
measurement itself. A well-known example of the latter is a frequently used 
sociometric survey design that asks each ego to rank their top k alters with 
respect to some criterion (liking, interaction frequency, etc.). For instance. 



Sampson (1968) famously asked each of 18 novitiates in a monastery to rank 



their three most liked novitiates among the other 17. That only the top 
three were ranked does not mean that the underlying ordering is partial 
and that the remaining 14 alters are tied. Rather, their ranking relative 
to each other is unobserved by design. Similarly, in 1995-1996, Wave I of 
National Longitudinal Study of Adolescent Health asked high school students 



to list, in order, up to 5 male and up to 5 female friends (Harris et al. 



2003). Again, the students unranked (either because they fell below the 



respondent's threshold for friendship or because the respondent ran out of 
space on the form) are not all substantively tied (i.e., assigned the same rank) 
but are merely unobserved. Cases of unobserved ranking are better handled 
by means of a latent variable framework in which a probability distribution 
is placed on the complete data, and in which likelihood is assessed via the 
marginalization of the complete data conditional on that observed. Such a 
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strategy has been used in the traditional ERGM framework by Handcock 



and Gile (2010), and can be employed here as well. 



In contrast, orderings in which the alters may be substantively tied or 
incomparable with each other — partial orderings — pose greater challenges. 
In the complete ordering case, there exists a distribution of rank-orderings 
that is unambiguously uniform and can thus serve as the baseline distribution 
{reference measure) for the exponential family (Barndorff-Nielsen, 1978, pp. 
115-116, for example). In the partially ordered case, there is no natural 
baseline that is unambiguously uniform. For example, an ego may rank all 
n — 1 alters equally (with only one such "ranking" possible) or he or she may 
assign a distinct rank to each (with {n — 1)\ possible rankings), or anything 
in between. A baseline distribution could, among myriad possibilities, assign 
the same weight to each of the (tt, — 1)! complete orderings as it does to the 
single all-tied partial ordering, or it could assign the same total weight to the 
class of all complete ordering as to the (singleton) class of all-tied orderings. 
Each such baseline is potentially useful in certain situations, and must be 
chosen on substantive grounds. Statistics selected for modeling partial orders 
must further represent not just actors' ranking propensities (relative to the 
chosen baseline) but also their choices not to rank. 

Thus, in this paper, we focused our discussion on complete orderings. De- 
spite our focus on complete orderings, our techniques can (with appropriate 
choice of reference measure) be generalized to the weaker case, however: all 
of the model statistics that we present in Section |3] can be applied to par- 
tially ordered network data without modification, because they only make 
references to the network of interest through the indicator ?/...>. 

Finally, we note that development of ERGMs for rank order data opens 
the way to a rich family of novel statistical models for phenomena such as 
interdependent choice behavior in group context, social influence on pref- 
erences, etc. Particularly because of their suitability for data collected in 
observational settings, rank order ERGMs provide a useful tool for consid- 
ering both new and classic problems in social psychology and the study of 
decision making. 
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Appendix: A simple sampling algorithm for a 
complete ranking ERGM 

We use a Metropolis-Hastings sampling algorithm (Algorithm [T]) to sample 
from a complete ranking ERGM. At every iteration, an ego is chosen at 
random and a symmetric proposal to swap the rankings of two of its alters 
(also chosen at random) is made. 

Algorithm 1 Sampling from a complete rank ERGM 
Let^ 

RandomChoose(y4) return a random element of a set A 
Uniform(a, b) return a random draw from the Uniform(a, b) 
distribution 

Require: y^^^ & y, S sufficiently large, A^, g{-), r){-) 
Ensure: a draw from a complete ranking ERGM Pr0;^ g(-) 
1: for s ^ {1 .. S} do 

2: i ^ RandomChoose(A^) {Select an ego at random.} 

3: j ^ RandomChoose(A^\ {i}) {Propose one alter.} 

4: j' ^ RandomChoose(A^\ {i, j}) {Propose another alter.} 

5: y* ^ (7/(^-1))*' ■'^■^ {Propose a swap.} 

6: r 4- exp{r,{e) ■ {g{y*) - ^(z/^^-^))) ) 

7: u ^ Uniform(0, 1) 

8: if M < r then 

9: ^z^*-* ^ y* {Accept the proposal.} 
10: else 

11: y^^^ ^ {Reject the proposal.} 

12: return y^^^ 
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